chore(asm): improve user blocking for django (auth middleware) #12069

christophe-papazian · 2025-01-24T12:37:32Z

This PR improve user blocking on Django by adding the possibility to block a previously authentified user.

Wrap AuthenticationMiddleware.process_request to check at the start of a new request, if an authentified user was already found and run the WAF on it. Ensure this patch is compatible with APM patches of middleware
Ensure the new way of blocking requests does not interfere with the old way on set_user, by allowing set_user blocking to be bypassed. We want to be sure we call the WAF exactly once.
Add support for "_dd.appsec.user.collection_mode" tag
Those changes will be tested and tracked by several system tests:
- tests/appsec/test_automated_user_and_session_tracking.py::Test_Automated_User_Tracking
- tests/appsec/test_automated_user_and_session_tracking.py::Test_Automated_User_Blocking::test_user_blocking_auto

DataDog/system-tests#3935

APPSEC-56505

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

…icated users on Django

github-actions · 2025-01-24T12:38:05Z

CODEOWNERS have been resolved as:

ddtrace/appsec/_constants.py                                            @DataDog/asm-python
ddtrace/appsec/_trace_utils.py                                          @DataDog/asm-python
ddtrace/appsec/trace_utils/__init__.py                                  @DataDog/asm-python
ddtrace/contrib/internal/django/patch.py                                @DataDog/apm-core-python @DataDog/apm-idm-python
ddtrace/contrib/internal/trace_utils.py                                 @DataDog/apm-core-python @DataDog/apm-idm-python

…jango_block_already_authentified_user

datadog-dd-trace-py-rkomorn · 2025-01-24T12:54:19Z

Datadog Report

Branch report: christophe-papazian/django_block_already_authentified_user
Commit report: a14acdb
Test service: dd-trace-py

✅ 0 Failed, 130 Passed, 1378 Skipped, 5m 10.31s Total duration (35m 3.89s time saved)

pr-commenter · 2025-01-24T14:24:25Z

Benchmarks

Benchmark execution time: 2025-01-30 14:41:35

Comparing candidate commit a14acdb in PR branch christophe-papazian/django_block_already_authentified_user with baseline commit f73a3fe in branch 3.x-staging.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 382 metrics, 2 unstable metrics.

ddtrace/appsec/_trace_utils.py

…jango_block_already_authentified_user

ddtrace/appsec/_processor.py

## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) --------- Co-authored-by: Nicole Cybul <[email protected]>

#12075) We added locking to the memory profiler to address crashes. These locks are mostly "try" locks, meaning we bail out if we can't acquire them right away. This was done defensively to mitigate the possibility of deadlock until we fully understood why the locks are needed and could guarantee their correctness. But as a result of using try locks, the `iter_events` function in particular can fail if the memory profiler lock is contended when it tries to collect profiling events. The function then returns NULL, leading to SystemError exceptions because we don't set an error. Even if we set an error, returning NULL isn't the right thing to do. It'll basically mean we wait until the next profile iteration, still accumulating events in the same buffer, and try again to upload the events. So we're going to get multiple iteration's worth of events. The right thing to do is take the lock unconditionally in `iter_events`. We can allocate the new tracker outside the memory allocation profiler lock so that we don't need to worry about reentrancy/deadlock issues if we start profiling that allocation. Then, the only thing we do under the lock is swap out the global tracker, so it's safe to take the lock unconditionally. Fixes #11831 TODO - regression test?

…le (#12035) ## Motivation Refactors all web server integrations still using `tracer.trace` to instead use `core.context_with_data`. This is in preparation for supporting AWS API Gateway to ensure all web servers share the same code path. ## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)

## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)

Depending of the timing, libddwaf loading process could create triggers that would create loops in our instrumentation. From what I investigated: - if loaded too early, it could have bad interactions with gevent. - if loaded too late, it could be self instrumented by the tracer, creating a loop, as ctypes is using Popen and subprocess. while keeping the late loading introduced by 2 previous PRs - #11987 - #12013 this PR introduced a mechanism to bypass tracer instrumentation during ctypes loading, to avoid a possible loop that would prevent the WAF to be loaded. ## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)

…t config (#12089) This PR makes a change to our shared distributed tracing header injection method to dispatch signals/events instead of relying on the global config settings, which is only modifiable via env vars. This fixes distributed tracing for users that might rely solely on the `LLMObs.enable()` setup config. Programmatic `LLMObs.enable()/disable()` calls do not set the global `config._llmobs_enabled` boolean setting, which is only controlled by the `DD_LLMOBS_ENABLED` env var. This was problematic for users that relied on manual `LLMObs.enable()` setup (i.e. no env vars) because our distributed tracing injection code only checks the global config to inject llmobs parent IDs into request headers. If users manually enabled LLMObs without any env vars, then this would not be reflected in the global config value and thus LLMObs parent IDs would never be injected into the request headers. We can't check directly if LLMObs is enabled in the http injection module because: 1. This would require us to import significant product-specific LLMObs-code into the shared http injector helper module which would impact non-LLMObs users' app performance 2. Circular imports in LLMObs which imports http injector logic to use in its own helpers Instead of doing our check based on the global `config._llmobs_enabled` setting, we now send a tracing event to our shared product listeners, and register a corresponding `LLMObs._inject_llmobs_context()` hook to be called for all inject() calls if LLMObs is enabled (we check the LLMObs instance, not the global config setting value). ~One risk and why I don't like changing global config settings is because this then implies that it is no longer global or tied to an env var (I want to push for env var configuration where possible over manual overriding/enabling). If a global enabled config can be toggled indiscriminately then this could open a can of worms for enabling/disabling logic in our LLMObs service, which isn't really designed to be toggled on/off multiple times in the app's lifespan. However if some users cannot rely on env vars, then I don't see any other solution that does not couple tracer internal code with LLMObs code which is a no-option.~ (UPDATE: we avoided this issue by using signal dispatching) ## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)

…jango_block_already_authentified_user

…azian/django_block_already_authentified_user

…ser.collection_mode tags

christophe-papazian added 2 commits January 23, 2025 13:53

add blocking to django auth

8a8add6

add support for AuthentificationMiddleware to detect and block athent…

b27e320

…icated users on Django

christophe-papazian added changelog/no-changelog A changelog entry is not required for this PR. ASM Application Security Monitoring labels Jan 24, 2025

christophe-papazian changed the title ~~add blocking to django auth~~ chore(asm): improve user blocking for django (auth middleware) Jan 24, 2025

christophe-papazian added 3 commits January 24, 2025 13:42

lint

3ee130c

Merge remote-tracking branch 'origin/main' into christophe-papazian/d…

3fd5496

…jango_block_already_authentified_user

ensure authentification

178feb7

christophe-papazian added 4 commits January 24, 2025 14:04

remove additional tracing

cfd1d1e

restore django_auth

b05361f

small restore

be66632

reenable listener

20db595

christophe-papazian added 3 commits January 24, 2025 15:31

keep apm tools

88fa824

revert line

3d5e91a

fix set_user

db21191

datadog-datadog-prod-us1 bot reviewed Jan 24, 2025

View reviewed changes

ddtrace/appsec/_trace_utils.py Outdated Show resolved Hide resolved

christophe-papazian added 4 commits January 24, 2025 16:48

remove debug code

d949d3d

import ddwaf import mechanism

eb3738e

Merge remote-tracking branch 'origin/main' into christophe-papazian/d…

5699920

…jango_block_already_authentified_user

more log for waf initialisation

667be44

datadog-datadog-prod-us1 bot reviewed Jan 27, 2025

View reviewed changes

ddtrace/appsec/_processor.py Outdated Show resolved Hide resolved

christophe-papazian and others added 6 commits January 27, 2025 15:17

bypass Popen instrumentation for waf initialisation

28ff780

bypass subprocess instrumentation for waf initialisation

993afc9

bypass subprocess.wait instrumentation for waf initialisation

09f1ab2

christophe-papazian and others added 7 commits January 28, 2025 14:48

Merge remote-tracking branch 'origin/main' into christophe-papazian/d…

5b98fdf

…jango_block_already_authentified_user

restore

894bb19

remove dispatch set_user

d3d715e

ensure that we don't call the waf twice for the same event

2e20bca

christophe-papazian changed the base branch from main to 3.x-staging January 30, 2025 13:03

christophe-papazian added 6 commits January 30, 2025 14:04

Merge remote-tracking branch 'origin/3.x-staging' into christophe-pap…

8fc13d9

…azian/django_block_already_authentified_user

revert main changes

89baa22

revert main changes

346e026

add appsec.user.id for auto collection on already authenticated and u…

48a53a3

…ser.collection_mode tags

fix span usage for _on_django_process

b47ea60

add collection mode for on_django_process

a14acdb

christophe-papazian marked this pull request as ready for review January 30, 2025 14:33

christophe-papazian requested review from a team as code owners January 30, 2025 14:33

christophe-papazian requested review from erikayasuda and wconti27 January 30, 2025 14:33

christophe-papazian mentioned this pull request Jan 30, 2025

[python] Enable more user event tests DataDog/system-tests#3935

Merged

7 tasks

emmettbutler approved these changes Jan 30, 2025

View reviewed changes

gnufede approved these changes Jan 31, 2025

View reviewed changes

christophe-papazian merged commit 92d8c5f into 3.x-staging Jan 31, 2025
633 of 634 checks passed

christophe-papazian deleted the christophe-papazian/django_block_already_authentified_user branch January 31, 2025 09:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(asm): improve user blocking for django (auth middleware) #12069

chore(asm): improve user blocking for django (auth middleware) #12069

christophe-papazian commented Jan 24, 2025 •

edited

Loading

github-actions bot commented Jan 24, 2025 •

edited

Loading

datadog-dd-trace-py-rkomorn bot commented Jan 24, 2025 •

edited

Loading

pr-commenter bot commented Jan 24, 2025 •

edited

Loading

chore(asm): improve user blocking for django (auth middleware) #12069

chore(asm): improve user blocking for django (auth middleware) #12069

Conversation

christophe-papazian commented Jan 24, 2025 • edited Loading

Checklist

Reviewer Checklist

github-actions bot commented Jan 24, 2025 • edited Loading

datadog-dd-trace-py-rkomorn bot commented Jan 24, 2025 • edited Loading

Datadog Report

pr-commenter bot commented Jan 24, 2025 • edited Loading

Benchmarks

christophe-papazian commented Jan 24, 2025 •

edited

Loading

github-actions bot commented Jan 24, 2025 •

edited

Loading

datadog-dd-trace-py-rkomorn bot commented Jan 24, 2025 •

edited

Loading

pr-commenter bot commented Jan 24, 2025 •

edited

Loading